Phoneme-based recognition for the norwegian speechdat(II) database
نویسنده
چکیده
This paper presents results from a number of exible vocabulary recognition experiments on the Norwegian SpeechDat(II) database. A common phoneme-based recogniser design procedure is tested on ve di erent tasks, and for ve di erent training sets. Results verify that reasonably accurate recognisers can be built with the database, using standard HMM techniques. They also quantify the importance of training set selection for small and medium vocabulary tasks.
منابع مشابه
Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering
The paper describes our ongoing work on crosslingual speech recognition based on multilingual triphone hidden Markov models. Multilingual acoustic models were built using two different clustering procedures: agglomerative triphone clustering and tree-based triphone clustering. The agglomerative clustering procedure is based on measuring the similarity of triphones on a phoneme level where the m...
متن کاملData driven generation of broad classes fo acoustic model
A new data driven approach for phonetic broad class generation is proposed. The phonetic broad classes are used by tree based clustering procedure for node questions during the context dependent acoustic models generation for speech recognition. The data driven approach is based on phoneme confusion matrix, which is produced with the phoneme recogniser. Such approach enables the data driven met...
متن کاملComparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models
Grapheme based mono-, crossand bilingual speech recognition of Czech and Slovak is presented in the paper. The training and testing procedures follow the MASPER initiative that was formed as a part of the COST 278 Action. All experiments were performed using Czech and Slovak SpeechDat-E databases. Grapheme-based models gave equivalent recognition performance compared to phoneme-based models in ...
متن کاملConversion from phoneme based to grapheme based acoustic models for speech recognition
This paper focuses on acoustic modeling in speech recognition. A novel approach how to build grapheme based acoustic models with conversion from existing phoneme based acoustic models is proposed. The grapheme based acoustic models are created as weighted sum from monophone acoustic models. The influence of particular monophone is determined with the phoneme to grapheme confusion matrix. Furthe...
متن کاملFRESCO: the French telephone speech data collection - part of the european Speechdat(m) project
This paper describes the design, collection and postprocessing of the French SpeechDat corpus FRESCO. Being a database of approximately 35,000 utterances recorded from 1000 callers over the terrestrial telephone network in France, it comprises immediately usable and relevant speech for the initial training and assessment of speaker-independent phoneme-model or wordmodel based speech recognizers...
متن کامل